Multimodal communication error detection for driver-car interaction

نویسندگان

  • Sy Bor Wang
  • David Demirdjian
  • Trevor Darrell
  • Hedvig Kjellström
چکیده

Speech recognition systems are now used in a wide variety of domains. They have recently been introduced in cars for hand-free control of radio, cell-phone and navigation applications. However, due to the ambient noise in the car recognition errors are relatively frequent. This paper tackles the problem of detecting when such recognition errors occur from the driver’s reaction. Automatic detection of communication errors in dialoguebased systems has been explored extensively in the speech community. The detection is most often based on prosody cues such as intensity and pitch. However, recent perceptual studies indicate that the detection can be improved significantly if both acoustic and visual modalities are taken into account. To this end, we present a framework for automatic audio-visual detection of communication errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Multimodal Interface for In-Car Communication Systems

The number of cars provided with an in-car communication system has considerably increased during the past few years. Using a mobile phone whilst driving is a safety-critical task and can cause usability issues. Speech modality has been incorporated in order to allocate hands and eyes solely to the driving task speech. This paper discusses an investigation into in-car communication systems and ...

متن کامل

A Multimodal Virtual Co-driver’s Problems with the Driver

The paper discusses a series of four user-oriented design analysis problems in a research prototype multimodal spoken language dialogue system for supporting drivers whilst driving. The problems are: (a) when should the system (not) listen to the speech and non-speech acoustics in the car; (b) how to use the in-car display in conjunction with spoken driver-system dialogue; (c) how to identify t...

متن کامل

Development of a Generic Multimodal Framework for Handling Error Patterns during Human-Machine Interaction

In this contribution, we present a generic and therefore easily scalable multimodal framework for error robust processing of user interactions in various domains. The system provides a generic kernel for evaluating user inputs and additional pieces of information from situational, personal, and functional context. After an initial domain-specific configuration, the system is capable of detectin...

متن کامل

Enhancing the Usability of Multimodal Virtual Co- drivers

This chapter discusses a series of four user-oriented design analysis problems in a research prototype multimodal spoken language dialogue system for supporting drivers whilst driving. The problems are: (a) when should the system (not) listen to the speech and non-speech acoustics in the car; (b) how to make use of the in-car display in conjunction with spoken driver-system dialogue; (c) how to...

متن کامل

The Effects of Modality, Urgency and Message Content on Responses to Multimodal Driver Displays

This work investigates the design and use of multimodal displays for the car. Driver cues that vary in urgency as well as message content and use the audio, tactile and visual modalities in all their unimodal and multimodal combinations have been designed and evaluated. The goal is to investigate how such displays can effectively alert drivers without distracting. This will form the basis for c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007